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For any physical observable in statistical systems, the most frequently studied quantities are its 
average and standard deviation. Yet, its full distribution often carries extremely interesting informa- 
tion and can be invoked to put any surprising properties of the individual moments into perspective. 
As an example, we consider a problem concerning simple random walks which was posed in a recent 
text. When a drunk is observed over L nights, taking N steps per night, and the number of steps to 
the right is recorded for each night, an average and a variance based on these data can be computed. 
When the variance is used to estimate p, the probability for the drunk to step right, complex values 
for p are frequently found. To put such obviously nonsensical results into context, we study the full 
probability distribution for the variance of the data string. We discuss the connection of our results 
to the problem of data binning and provide two other brief examples to demonstrate the importance 
of full distributions. 



I. INTRODUCTION 

Many properties of the random walk are well-known^ 
and form an important part of good texts in statisti- 
cal mechanics, at either the undergraduate or graduate 
leveP. Despite its "age,"^ this problem continues to 
present new and interesting puzzles, depending on the 
questions asked of the walker. Many recent examples of 
such puzzles, some of which remain unsolved, involve the 
issue oifull distributions (as opposed to just the averages) 
of certain quantities. Providing an exhaustive list is be- 
yond the scope of this paper. Instead, in the concluding 
section, we discuss several examples to illustrate both 
the value and the excitement in such studies. Our main 
interest lies in a distribution rarely discussed in texts, 
in connection with a problem posed in the manuscript 
of an undergraduate text book on statistical mechanics, 
co-authored by H. Gould and J. Tobochnik (GT)'*. 

Let us first quote the question which motivated this 
paper, namely, the first two parts of problem 3.37 in this 
manuscript. 

A random walker is observed to take a total 
of N steps, n of which are to the right. 

(a) Suppose that a curious observer finds 
that on ten successive nights the walker takes 
N = 20 steps and that the values of n are 
given successively by 14, 13, 11, 12, 11, 12, 
16, 16, 14, 8. Compute n, n^, and f7„. Use 
this information to estimate p. If your reason- 
ing gives different values for p, which estimate 
is likely to be the most accurate? 

(b) Suppose that on another ten successive 
nights the same walker takes N — 100 steps 
and that the values of n are given by 58, 69, 
71, 58, 63, 53, 64, 66, 65, 50. Compute the 
same quantities as in part (a) and estimate p. 
... Explain your results. 



To help the reader with context and notation, we add 
that this problem is at the end of a section about a sim- 
ple random walk in one dimension, stepping either to 
the right or left, with probability p and 1 — p, respec- 
tively. The standard results for the average number of 
right steps, Np, and the associated variance, Np {1 — p), 
were derived. We will refer to these expressions as the 
"true average" and the "true variance," denoted by 

Uav = Np and v = Np (1 — p) (1) 

respectively. They would result if we were to observe the 
walker for very (ideally, infinitely) many nights. 

In the problem, however, n and (T„ denote the average 
and the standard deviation (i.e., \J rp- — Ti?) computed 
from observations covering only a relatively small number 
(ten) of nights. So, estimates for p are to be made from 
either the equation for the average 

n = Npav (2) 
or one for the standard deviation 

<Jn = VNpa (1 - Pa) ■ (3) 

The authors pointed out^ that, since the second method 
involves a quadratic equation for p, there will be two 
solutions (indeed, symmetric around p = 1/2), so that 
this route cannot produce a unique answer by itself. 

More interestingly, if straightforward computations are 
carried out, it is even more trivial to answer the question 
concerning which estimate is more "accurate" : p^ is com- 
plex for both (a) and (b)! When we trace the origins of 
this remarkable result, we find that, though the true vari- 
ance V never exceeds N/ 4, the data string of a particular 
night can easily exceed this bound. One of the "worst 
case" scenarios - focusing again on 10 nights - occurs 
when the drunk takes N steps to the right for IQp nights 
and N steps to the left for 10 (1 — p) nights. For example, 
instead of the first string given in part (a) of the problem. 
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we have (assuming p = 0.6) 20,20,0,0,20,0,20,20,0,20. 
Though such a string even provides the exact underly- 
ing p, it leads to cr^ax = N'^p{l — p)- Note that cr^ax 
comes with even a wrong power of N, so that, for suffi- 
ciently large A'', it will always exceed the largest possible 
true variance, maXpg[o,i] Np (1 — p) = N/A, regardless of 
the underlying p\ At the other extreme, if the same n 
is observed on every night, then the absolute minimum 
CTmin = is achieved. Though this (Tmin leads to a real 
Pa, the result {pa = or 1) is clearly also unreasonable. 
Although we can easily compute the expectation values 
of (7n and Pa-, a natural question arises: how likely is it 
that exceeds the absolute bound of N/A7 Similarly, 
we could ask for the likelihood that Pa comes within, 
say, 5% of the underlying p. Thus, we are led to study 
the full distribution of variances in the observations of a 
drunkard. 

The rest of this paper is organized as follows. The next 
section is devoted to the detailed analysis of the issues at 
hand. Section III is provided as a convenience, for those 
who wish to skip the details for now, read a summary of 
our conclusions, and see how they apply to the specific 
problem here. In a final section, we make brief sugges- 
tions for expanding discussions of distributions in typical 
courses on statistical mechanics. 



Needless to say, < Ua < N and < K < (Note 
the iV^ in the last bound, which comes from the worst 
case scenario with only right steps for exactly half the 
nights!) Clearly, fia and Va are still random variables, 
with respect to the ensemble of drunks. If we make his- 
tograms from the two sets of M numbers, denoted by 
H (n) and H {V), we get a glimpse of the full distribution 
of their possible values. The first is easy, being related to 
the binomial distribution, and approaches a Gaussian for 
large A^. Nevertheless, there are some subtleties to which 
we will alert the reader. The second will give us an idea 
of how often we can expect a data string to produce a 
variance which exceeds the maximum possible value, for 
any realizable p, namely, N/A. 



A. Statistics of averages 

For completeness, let us remind the reader that, if we 
focus on the statistics of a single drunkard taking N steps 
on a single night, then the probability that he takes n 
right steps is just the binomial distribution: 



P{n)={^y"{l-pf-- 



(6) 



II. DISTRIBUTIONS OF AVERAGES AND 
VARIANCES 

To motivate the study of distributions, let us use the 
language of human behaviorists to define an "ensemble" 
of many identical drunkards (say, M) residing in different 
cities. The probability of stepping to the right is p, for all 
drunkards in all nights. Each is observed for L nights, 
N total steps each night. The number of steps to the 
right, n, differs each night and for different walkers. So, 
the entire data set can be summarized by M strings of L 
numbers: 



with 



n„,i , a€[l,M];ie[l,L] 



0<nai< N . 



(4) 



Alternatively, we can imagine the string of numbers in 
GT's text being generated randomly (with the same p) 
each time the URL is accessed. After M readings, we 
would have M different strings. So, for the two cases 
stated in the problem above, M is just 1 and L = 10, 
while = 20 and 100. 

From this big data set, we can construct M averages 
and variances, each generated from the data set for one 
of the drunks: 



(na) 



(5) 



If he is observed for L nights (for a total of LN steps), 
then the probability that he took a total of m steps to the 
right is clearly also a binomial: (^^)p'^ {1 — p)^^~"^ . 
But, notice that n, the average over the L nights of right 
steps, is precisely m/L. Thus, we may conclude imme- 
diately that the probability distribution for the average 
number of right steps is 



(i-p) 



L{N-n) 



(7) 



If we take data for M drunkards and compile a histogram 
for the set of fia, we should find 



H (n) ^ MV (n) 



(8) 



as M — > 00. 

This distribution also tells us how reliable our estimate 
for p is when we have data for only one drunkard (as 
posed in the text problem). In particular, as we can guess 
intuitively, although we arc most likely to get the right 
p by just dividing n by (i.e., V {fi) peaks at Np), the 
chances we are off can be estimated through the standard 
deviation in V (n), namely, Np (1 — p) /L. 

Finally, if A^ is large, the binomial P (n) is indistin- 
guishable from a Gaussian: 



P(n) = 



1 



: exp 



(n - rigy) 
2v 



(9) 



Since convolutions of Gaussians form a Gaussian, the dis- 
tribution of the averages is also a Gaussian, with variance 
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v/L.ln other words, we should find 
H{n) 



V {n;nav,v,L) 



A e-^^^^ (10) 

2'KV ^ ' 



Note that wc have inchidcd an cxpUcit hst of the param- 
eters which control the distribution V . 



B. Statistics of variances 

Next, wc turn to the central issue of this paper. If -p 
is estimated through the variance of a single string of L 
observations (n^), how often can we expect the estimate 
to be complex? In other words, what is the probability 
that the variance L"^ n? — exceeds the absolute 
bound 



VaU = N/A ? 



(11) 



To answer this question, we focus on the probability that 
L^^ nf — f? assumes a given value, say, V . First, we 
ask for the (joint) probability that the number of right 
steps, observed over L nights, takes the values ni, n2,..., 
riL- Since the events of any night are independent of 
all other nights, this joint probability P (ni, n2, til) is 
just the product of the probabilities for a single night, i.e., 
J^^^j^ P{ni) - and P{nf) is simply given by the binomial, 
Eqn (6). Summing over all possible outcomes {rn} = 
{ni, n2, Ul} by using a Kronecker delta to count only 
those for which the variance equals V, we obtain 



V{V) 




1 ^ 1 ^ 



(12) 

Unfortunately, we are unable to evaluate this expression 
exactly. However, an excellent approximation can be 
obtained if is not too small (as in the cases posed 
in Ref. 4) so that we can use the Gaussian approxima- 
tion, Eqn (9), instead of the exact binomial, Eqn (6). A 
reader intimately familiar with error analysis will recog- 
nize our quest as the probability density function for the 
X^-distribution^. For pedagogical purposes, let us show 
how to make progress from this point. Let the probabil- 
ity that the variance lies in the interval [F, V + dV] be 
V{V)dV. Then, 



V{V;v,L)=]\ dneP{ne)x 

— oo 



(13) 



where we have included the relevant parameters {v, L) 
explicitly. Note that all variables are now continuous 
with infinite range and, correspondingly, the 5 here is 



the Dirac delta. To evaluate this distribution, consider 
its Laplace transform 



£(/i) = j dVe-^^P{V;v,L) 
= / dneP {nt) x 



exp 




(14) 



Since each P is a Gaussian, we have here a generalized 
Gaussian integral. Explicitly, we have 



>c(M)=n / 



dni 



(15) 



exp 



1=1 



L2 



To proceed, it is best to displace each by Np. Verifying 
that the variance is independent of such a shift, we have 
a simpler form: 



£(m) 



exp 




(16) 



Rescaling each by \/vL/ {L + 2/zw) simplifies this ex- 
pression further: 



£(m) 



L 



+ 2ij,v 

L 



L/2 L 

n 



dni 



2tt 



(17) 



exp < -- 



1 



2^v 



{L + 2nv) \ /L 



1 ^ 



1 = 1 ^ r , ^ y 

The exponent should be regarded as a quadratic form 



(18) 



where the matrix is just an identity matrix plus a term 
proportional to the tensor product of a single unit vector, 
namely, (1, 1, 1) /VL. The eigenvalues of such a ma- 
trix are all unity, except for one, which is unity plus the 
value of this proportionality constant. In Eqn (17), we 
have purposefully written the last term in the exponent 
to display this constant, i.e., —2fjbv/ {L + 2fiv). Exploit- 
ing 



n 



dni 



exp 



(detg) 



-1/2 



(19) 
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we have 



L 



L + 2nv 
L 

L + 2^v 



L/2 



(L-l)/2 



2jiv 



{L + 2nv) 



-1/2 



(20) 



To obtain V {V; v, L), we only need to perform an inverse 
Laplace transform. Fortunately, £ (/i) is simple enough 
to appear in standard tables^ and the explicit answer is 



ViV;v,L) = 



'LV\ 

vr (^) \2v ) 



(L-l)/2 



-LV/2v 



(21) 



Of course, this distribution may be written in scaling 
form: 



where the (non-negative) scaling variable is 

LV 

2v 

and $ is the standard gamma distribution^: 

*7 i^) 



with 



r(7) 



7^(L-l)/2 



(22) 



(23) 



(24) 



for our case. We briefly note some of its properties: It 
peaks at Xpeak = max {0, 7 — 1} and has moments (a;") = 
r(n + 7)/r(7). Thus, its average and variance both equal 
7: (x) = {x^) - {xf = 7. 

Returning to the problem we posed for the human be- 
haviorist, namely, to compile a histogram for the set of 
Va, Eqn (5), on the basis of data for M drunkards, Eqn 
(4), we should find 



H{V) MV {V-v,L) 



(25) 



for very large M . 

Before applying this result to the problem quoted 
above, let us comment on several interesting features. 

From Eqn (23), wc might infer that the relevant scale 

for V iw v/L = Np{l — p) /L. Nonetheless, the average 
(or expectation value) of V , being 



Va. = ^-,=Np{l-p) 



1 



(26) 



is much closer to w, especially for large L. Another in- 
teresting quantity is the most likely value for V , which 
is simply related to the peak value of $7, via Vpeak = 



{2v/L) : 



'peak ■ 



So, 



^peak — 



Np{l-p) [1- 



for L < 3 
for L > 3 



(27) 



Both are consistently less than Np (1 — p), the "true vari- 
ance" . Though both approach v monotonically as L in- 
creases, these results provide a measure of how much the 
variance of a short data string (i.e., small L) can differ 
from its asymptotic (L 00) value. 

Since short data strings lead to the most serious dis- 
crepancies, let us make a detour for the cases L = 1,2, 
and 3. The appearance of F (■^^^) in the denominator 
of Eqn (21) ensures that, for L = 1, :P = for F ^ 0. 
On the other hand, this distribution is normalized, so 
that we may conclude V {V\ w, 0) = 5 {V), regardless of 
V. This result is perfectly understandable, since L = \ 
corresponds to our observing the drunkard for only a sin- 
gle night. If a "data set" consists of only a single number, 
the "variance" is necessarily zero, regardless of the behav- 
ior of the drunkard! For L = 2, V diverges at the origin. 
There is no cause for alarm, however, as the divergence is 
weak enough for J VdV to be finite in any neighborhood 
of V" = 0. Indeed, a better perspective is provided by the 
distribution for the standard deviation 

a = \/V 

Then, we find a familiar looking expression: 

P {a; V, 2) = {V; v, 2) = -^e'^'/^ 

da J-KV 



However, keep in mind that we have only "half a Gaus- 
sian" here, since cr e [0, 00]! Finally, the other curious 
case is L = 3 where V (F ; v, 3) is a pure exponential, for 
which the standard deviation assumes the same value as 
the average. By contrast, the situation for large L pro- 
vides few surprises. The gamma distribution approaches 
a Gaussian with width O ^l/vT^ 



III. SUMMARY AND APPLICATION TO THE 
GOULD-TOBOCHNIK* PROBLEM 

Let us summarize our findings before applying them to 
the problem quoted in the Introduction. For the reader's 
convenience, we reiterate the set-up: Consider an "en- 
semble" of M identical drunkards with p as the proba- 
bility of stepping to the right. Each is observed for L 
nights, taking N total steps each night. For drunkard 
a (= 1, ...,M), the number of right steps is recorded for 
each night (i = 1, ...V) and denoted by Ua^i ■ From these, 
we compute the averages fia = ^^ria^i and vari- 
ances Va = J2i '^Q i ~ ("-q)^- Normalized histograms 
for these are compiled and denoted by H (n) /M and 
H {V) /M. We find that, already for moderate N (such 
as 20), the first will approach a Gaussian distribution 
while the second, a gamma distribution. Explicitly, 



H{n) 



VL Z.(fi 
7^ — e 



(28) 



4 



and 



H{V) _^ 

M M->oo 



V{V;v,L) = 



x'e 



vr (7) 



(29) 



where = Np, v = Np (1 — p), x = LV/2v, and 7 = 
(L-l)/2. 

Applying these results to the specific problem at hand, 
let us assume that the underlying p is 0.6. With L = 
10 and A'' = 20 or 100, we have the full distribution of 
variances 



with 



V {V; 0.247V, 10) = 



V 



VT (4.5) ' 



0.048^^ 



(30) 



(31) 



As noted in the previous section, the peak of V occurs at 
v[l — 3/L], which is 



Vpeak = O.ieSAT 



(32) 



in this case. Although this value appears far below the 

maximum allowed, Vabs = 0.25N, we should be more 
careful with our original question: 

How likely will V exceed Vabs"^ 

To answer this question, we consider the integral 

p {p, L) = prob {V >Vabs)= V {V; v, L) dV , (33) 

J N/4 



Using the scaled variable, p is just 
P {P, L) = ^ 



r(7) ..... 



(34) 



which should be recognized as a standard x^-probability 
function^ evaluated at a particular point: 



Xabs 



LVab 



7+1/2 

2Np (1 - p) 4p (1 - p) 



Note that p is now manifestly independent of N, the num- 
ber of steps taken by each drunk per night. As a result, 
we have written explicitly in Eqn (34) that it depends 
solely on p and L. 

Applying this integral to the case at hand, we write 



p (0.6, 10)^ 



1 



x^-^e'^^dx 



(35) 



r (4.5) 75.2083 

and, computing the integral numerically, find 

p (0.6, 10) ^ 0.3178 . (36) 

This is an astonishingly large value! In other words, by 
observing M drunkards (identical ones, with 0.6 proba- 
bility of stepping to the right) for 10 nights each, about 



Af/3 of the data sets will lead to a variance exceeding the 
absolute bound of N/41 In the equivalent scenario, if the 
string of numbers in the text^ were generated anew with 
each access to the URL, then about a third of the stu- 
dents would find complex p^'s by using a„, Eqn (3). Let 
us remind the reader that this conclusion is independent 
of Af! 

For students to appreciate these considerations better, 
it suffices to set up a crude Excel spreadsheet. Assuming 
p = 0.6 and placing the formula IF(RAND()i0.6,l,0) in, 
e.g., the block A1-T12000, we can "create" the data (with 
right/left steps recorded as 1/0) for M = 1200 dnmkards, 
observed on L = 10 nights (10 successive rows for each 
drunkard), for A^ = 20 steps (columns A-T). Summing 
each row into column U, we have the "data set" {na,i }■ 
After computing the averages and variances for the num- 
bers in column U (in blocks of 10), the histogram function 
can be invoked to provide H (n) and H{V). In Fig. 1, 
we show the result for the latter (for a particular run, 
of course). Since the peak position is expected to be at 
3.36 (considerably less than the "true variance" of 4.8!), 
we choose bins of width 0.4, so that the center of one bin 
(3.4) approximately coincides with the peak. As we see 
from the histogram, the peak frequency indeed lies in the 
appropriate bin. In Fig. 1, we also plot the theoretical 
distribution, Eqn (30), multiplied by M = 1200. The 
zero-parameter "fit" is clearly excellent. Since the peak 
is quite far from the average, we have also computed the 
latter and find Vav — 4.331. This value is entirely con- 
sistent with the predicted (0.24) (20) (0.9) = 4.32, its ap- 
proximate location being indicated by an arrow in Fig. 1. 
Lastly, we program the spread sheet to find the number 
of F's lying above the bound of 5 in this sample. The 
result, for this particular run was 374, again consistent 
with the predicted 381 = (0.3178)(1200). 



O 1200 runs 
— analytic form 




12.5 



15 



FIG. 1. Histogram and analytic form for the distribution 
of vaxiances. The average variance, Vav — 4.3, is indicated by 
the arrow. The data are "observations" of 1200 drunkards for 
10 nights, each taking 20 steps each night. 

Given this large percentage of obviously non-sensical 
estimates, a conscientious observer would naturally at- 
tempt to improve the situation, by making more obser- 
vations. Noting that this diSiculty is independent of A'' 
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(the number of steps tallied on each night) , our observer 
might try increasing L, by watching the drunks for more 
than 10 nights. One might hope that taking data for 100 
nights, say, should surely lead to a much larger propor- 
tion of "good data" (i.e., variances that lead to real esti- 
mates for p). Mustering extreme patience, our observer 
collects the necessary data, performs the analysis and is 
absolutely shocked to discover the result: instead of drop- 
ping sharply, p has increased even further, to 0.3416! In 
fact, a little thought shows that p cannot be a monotoni- 
cally decreasing function of L. Indeed, for L = 1, p {p, 1) 
vanishes, since V — S (V). So, at least initially, p(0.6,L) 
increases with L. In fact, it peaks at L = 41 where it 
takes its maximum value of 0.35553. Beyond L = 41, 
it decreases, but so slowly that p (0.6, 10^) = 0.3416 is 
still larger than p (0.6, 10) = 0.3178. Indeed, it decreases 
exceedingly slowly, so that, e.g., p (0.6, 10^) = 17%! 



P 




1 10 100 1000 

L 

FIG. 2. The probability p (p, L) that the variance will ex- 
ceed the absolute bound N/4, plotted vs L for various p. The 
curves, from the top, are for p = 0.5, 0.6, 0.7, and 0.8, respec- 
tively. 

Since p is identically zero at L = 1 (for any p), while 
p{p,L ^ oo) is expected to vanish, an interesting ques- 
tion is the following. Given p, for what value of L will p 
reach its maximum? Though easy to pose, wc have not 
found a simple analytic answer to this question. Instead, 
in Fig. 2, we present plots of p [p, L) for a few p's, so 
that our readers can appreciate its general behavior. Let 
us end by simply stating another remarkable result: for 
the unbiased waSk, p (0.5, L) is a monotonically increasing 
function of L, with limiting value of 1/2. In other words, 
as the number of observation nights is increased, the like- 
lyhood of the computed variance exceeding the true one 
increases. In the limit of compiling data from "infinitely 
many" nights (of A^-step walks), it is equally likely for 
the variance to exceed 7V/4 (the true value) as othca'wisc! 
For students interested in critical phenomena, it is possi- 
ble to regard {p, L) = (0.5, oo) as a critical point and to 
seek a scaling function for p involving [p — 0.5) L^/'^. A 
full analysis of these issues would be more suitable, we 
believe, for the readership of another journal. 



IV. CONCLUDING REMARKS 

We have analyzed an interesting problem posed in the 
manuscript of Gould and Tobochnik^. A random walker 
is observed for 10 nights, taking 20 steps each night. The 
number of right steps taken each night is recorded, and 
the student is asked to compute the average and variance 
from, and for, these 10 data points. Letting p denote the 
probability of the walker taking a right step, the true 
variance is just 20p(l — p) and can never exceed 5. Nev- 
ertheless, this bound is exceeded for the specific sequence 
of ten numbers in the text! In an effort to resolve this 
surprisingly result, we find that, if many such sequences 
(of ten numbers) were generated according to p = 0.6, 
nearly 32% of them will lead to a variance exceeding the 
absolute bound. Indeed, we found the full distribution 
for the variance, V {V;L,N,p), for the general case {L 
nights, N steps, arbitrary p). 

There is an alternative perspective to the behavior dis- 
cussed here, namely, in terms of the statistics associated 
with "binning." After all, if each step of the random 
walker is independent, then - instead of binning the data 
into L nights of N stcips ciach wc might as well con- 
sider the whole data set as a single string of NL steps. 
Associated with this string is, of course, just a single 
number: n, the total number of right steps. Seeking, as 
before, the "average" and "variance" of this long string, 
the "average" is simply h, giving just one estimate for 
p, i.e., n/NL. Moreover, the "variance" is necessarily 
zero. Now, by observing M drunkards, we can make a 
histogram of the associated estimates for p. For large M, 
this histogram will approach the theoretical distribution 
of a binomial associated with (^^) . If NL is also large, 
the histogram is well approximated by a Gaussian cen- 
tered on p with standard deviation ^/p (1 — p) /NL. To 
write this string in the setting of the textbook problem, 
we would bin the NL steps into L bins and arrive at L 
numbers 77- j ^ clS well as L estimates (for p): ni/L. The 
average of these estimates is exactly n/NL, i.e., as if no 
binning took place. However, by binning, we have cre- 
ated a non-trivial "variance," associated with the binned 
values n,! Since it will depend on the bin size (or equiv- 
alently, the number of bins: L), we should denote such a 
variance by Vl- Above, we have presented its distribu- 
tion and its non-trivial dependence on L. 

Our goal in this article is to demonstrate that the full 
distribution of a physical quantity often carries extremely 
interesting information and can be invoked to put sur- 
prising properties of individual moments into perspec- 
tive. We believe that this message is central to statisti- 
cal mechanics and should be taught within the context of 
upper level courses. Of course, the notion of full distri- 
butions is frequently included in texts but typically lim- 
ited to the binomial and Gaussian distributions. In the 
context of the random walk and related problems, these 
lead naturally to a discussion of the central limit theo- 
rem. This theorem elevates the Gaussian up to the status 
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of a "universal" distribution. The appeal of this notion 
is so strong that students might be lulled into thinking 
that all distributions are "normal" or "bell shaped," for 
which only average and variance are needed. Yet, many 
distributions are not normal and display a variety of in- 
teresting properties. A good example is the Poisson dis- 
tribution, included in GT'' to convince readers that they 
need not be afraid of flying. For these reasons, we be- 
lieve that it would bo valuable to devote some lessons 
in a course on statistical mechanics, or an entire section 
of a graduate or an undergraduate text, to distributions. 
In particular, the notion of derived, or induced distribu- 
tions is very important: In many areas of physics, we 
are frequently interested in stochastic quantities which 
are themselves specific functions of other, underlying ran- 
dom variables (often assumed to be Gaussian). Of course, 
finding the full distribution of such quantities can turn 
into a very challenging task but the rewards can be con- 
siderable, as we seek to demonstrate with two examples, 
both of which arc relatives of the simple onc-dimcnsional 
random walk. The first, concerning the statistical widths 
of a one-dimensional interface leads to a universal (but 
non-Gaussian) distribution, describing a whole class of 
interface models. The second illustrates the observation 
that, depending on the question posed, even simple ran- 
dom walks can give rise to non-analyticities which may 
be difficult to explore fully. 

Statistical widths of a one-dimensional interface, or av- 
erage deviations in a random walk. The simple random 
walk on a line is just a string of R and L steps. Plot- 
ting the displacement on the y-axis and the number i 
of steps taken on the abscissa, a particular walk can be 
viewed as a specific configuration of a one- dimensional 
interface, embedded in a two-dimensional bulk: the inter- 
face height hi is measured along y vs the "column label" 
i — 1,2, ...N . In the language of interfaces, two natu- 
ral and frequently considered quantities are the average 
height h = ^^hi/N and width (easily recognized as a 

standard deviation) w = \/^jljh--^^)^~/N of a given 
interface. However, in the context of random walks, nei- 
ther of these quantities comes to mind easily. Instead of 
being an ensemble average over very many walks, h is 
the average displacement over tim,e, of a specific walk in 
a specific night, and hence depends on the entire history 
of this particular walk. Similarly, w is different from one 
interface to another and, in terms of the drunkard, typ- 
ically changes from one night to the next. Here, we are 
interested in the full distributions of h and w. Obviously, 
we can compile their histograms, H (Ji) and H{w), by 
generating many interface configurations or by observ- 
ing the drunkard over many nights. If the underlying 
probability is symmetric (p(±l) = 1/2), then it is easy 
to imagine that H (h) is symmetric and approaches a 
Gaussian, thanks to the central limit theorem. On the 
other hand, it is not so easy to guess what H (w) should 
be. First, w is never negative, so H (w) must "end" at 
w = 0. Second, though the average (and perhaps the 



peak position) should increase with \/N, the "top end" 
of H (w) will be 0{N). Referring to Ref. 8 for details, 
we just summarize the key results. Noting that (w^), the 
width-square averaged over all possible configurations, is 
just L/12, we obtain H (w) ~ exp [— tt^w^/ (m)^)] for 
large widths and H (w) ~ exp [—3 (^w"^) /2w^] for 
small w^s. In other words, configurations with widths 
significantly larger /smaller than (ui^) are exponentially 
suppressed. Remarkably, this distribution is universal, 
describing a whole class of interface models, in the same 
sense that the Gaussian is a limiting distribution for sums 
of random numbers. 

Distribution of longest returns. Here, we turn to an 
extremely interesting example, in which there is no an- 
alytic solution even for simple one-dimensional random 
walks. Motivated by the physics of charged polymers 
(a string of monomers, each carrying charge ±1), Er- 
tas and Kantor^ explored the properties of the largest 
neutral segments. Translated into the language of one- 
dimensional random walks, a neutral segment, consisting 
of equal numbers of opposite charges, corresponds to a 
part of the walk where the walker returns to a partic- 
ular site. So, the "largest neutral segment" maps into 
the "longest return path," or largest number of steps be- 
tween visits to the same site (regardless of which site). A 
more explicit phrasing of the question is: "Of all the 2^ 
random walks of N steps on a line, how many have N' 
(with < iV' < TV) as the longest return path?" Denot- 
ing the answer by H {N'; N), the normalized distribution 
is P{N') ^ H {'n'; N) /2^. Again, it may be helpful to 
consider the extremes. For N' = 0, there are precisely 
two walks: all right or all left steps. So, H (0; N) = 2. 
At the other extreme, for N' = N, these walks are the 
familiar ones which return to the starting point, so that 
H {N; N) = (^^2) . In the hmit of large N, it is better 

to use the fraction (j> = N' /N as a variable and to take 
the continuum limit, so that an appropriate probability 
density, p{(j)), emerges from P{N'). Remarkably, p{(j)) 
develops a kink, i.e., a discontinuous first derivative, at 
0=1/2 (Fig. 3 in Ref. 9)! Despite the simple sound- 
ing nature of this question, it is clearly quite a complex 
issue. Though some understanding of this unexpected 
phenomenon is possible, it is far from being completely 
solved. 

To conclude, we hope to have motivated teachers and 
students alike to think beyond the first few moments and 
to focus on full distributions, if at all possible. To whet 
our readers' appetites, we have presented a particularly 
striking example in the context of simple random walks: 
if one uses finite data strings to estimate the (asymp- 
totic) probability p, the estimate can easily turn out to 
be manifestly nonsensical, namely, complex! Beyond this 
"demonstration" , we briefiy discussed two further exam- 
ples, all associated with random walks. Hopefully, these 
concepts will challenge our readers to explore, or discover, 
their own favorite distributions. 
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